# Fine-tuned wav2vec2

Wav2vec2 Large Xlsr 53 Th Speech Emotion Recognition 3c
A Thai speech emotion recognition model fine-tuned from airesearch/wav2vec2-large-xlsr-53-th, supporting anger, happiness, and calm emotion classification
Audio Classification Transformers
W
Paranchai
9
0
Wav2vec2 Turkish Gender Classification
Apache-2.0
A Turkish gender classification model fine-tuned from facebook/wav2vec2-base, trained on the common_voice_17_0 dataset with a test set accuracy of 84.79%
Audio Classification Transformers
W
candenizkocak
19
1
Wav2vec2 Base ASVSpoof5 TUC N
Apache-2.0
A voice anti-spoofing detection model fine-tuned based on wav2vec2-base, achieving 88.89% accuracy on the evaluation set
Audio Classification Transformers
W
DavidCombei
20
0
Violence Detect 44
Apache-2.0
An audio classification model fine-tuned from facebook/wav2vec2-base-960h for detecting violent sounds
Audio Classification Transformers
V
Hemg
28
0
Wav2vec2 Base Gender Classification
Apache-2.0
A fine-tuned voice gender classification model based on facebook/wav2vec2-base, achieving 98.92% accuracy on the evaluation set
Audio Classification Transformers
W
7wolf
14
1
Wav2vec2 Audio Emotion Classification
Apache-2.0
A fine-tuned audio emotion classification model based on facebook/wav2vec2-base for analyzing emotional states in speech
Audio Classification Transformers
W
dhanush23
15
0
Wav2vec2 Phenome Based Alffaamharic
Apache-2.0
A wav2vec2-based speech recognition model, fine-tuned at the phoneme level for Amharic
Speech Recognition Transformers
W
Samuael
34
2
Wav2vec2 Base Down On
Apache-2.0
A binary audio classification model fine-tuned from facebook/wav2vec2-base, specifically designed to distinguish between the pronunciations of 'down' and 'on'
Audio Classification Transformers
W
MatsRooth
20
0
Wav2vec2 Base Music Speech Both Classification
Apache-2.0
An audio classification model fine-tuned based on facebook/wav2vec2-base for distinguishing between music and speech
Audio Classification Transformers
W
FerhatDk
20
0
Neunit Nihaochangchu V3
Apache-2.0
An audio classification model fine-tuned based on facebook/wav2vec2-base, trained on the superb dataset with 99.99% accuracy
Audio Classification Transformers
N
SHENMU007
14
0
Bsc Ai Thesis Torgo Model 1
Apache-2.0
A speech processing model fine-tuned based on facebook/wav2vec2-base, demonstrating excellent performance on the evaluation set
Speech Recognition Transformers
B
Juardo
19
0
Wav2musicgenre
Apache-2.0
An audio classification model fine-tuned based on facebook/wav2vec2-base for music genre recognition
Audio Classification Transformers
W
ramonpzg
20
0
Voip Classification
Apache-2.0
A fine-tuned speech classification model based on facebook/wav2vec2-base for audio folder dataset classification tasks
Audio Classification Transformers
V
james-xie-rng
18
0
Neunit Ks Kangyuan0601
Apache-2.0
This model is a fine-tuned audio classification model based on facebook/wav2vec2-base on the superb dataset, achieving 99.87% accuracy on the evaluation set.
Audio Classification Transformers
N
SHENMU007
16
0
Neunit Ks 529
Apache-2.0
An audio classification model fine-tuned on the SUPERB dataset based on facebook/wav2vec2-base, achieving 99.98% accuracy
Audio Classification Transformers
N
SHENMU007
14
0
Wav2vec2 Base Toronto Emotional Speech Set
Apache-2.0
An audio emotion classification model fine-tuned based on wav2vec2-base, used to identify the speaker's emotional state.
Audio Classification Transformers English
W
DunnBC22
185
3
Ser Model
Apache-2.0
A fine-tuned speech emotion recognition model based on facebook/wav2vec2-base, achieving 84.71% accuracy on the evaluation set
Audio Classification Transformers
S
aherzberg
30
0
Is Vinyl Scratched Or Not
Apache-2.0
An audio classification model fine-tuned based on wav2vec2-base, used to detect scratches in vinyl record audio.
Audio Classification Transformers English
I
DunnBC22
22
1
Wav2vec2 Base Finetuned Coscan Age Group
Apache-2.0
Age group classification model fine-tuned on the COSCAN-speech dataset based on wav2vec2-base, achieving 99.8% accuracy on the validation set
Audio Classification Transformers
W
versae
34
0
Exp W2v2t En Vp Nl S281
Apache-2.0
An English speech recognition model fine-tuned based on facebook/wav2vec2-large-nl-voxpopuli, trained using the Common Voice 7.0 training set.
Speech Recognition Transformers English
E
jonatasgrosman
18
0
Wav2vec2 Final 1 Lm 4
Apache-2.0
A speech recognition model fine-tuned based on facebook/wav2vec2-base, achieving a word error rate of 0.4499 on the evaluation set
Speech Recognition Transformers
W
chrisvinsen
16
0
Wav2vec2 Large Xls R 300m Kinyarwanda
Apache-2.0
A Kinyarwanda speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
W
peter2000
13
0
Ai Light Dance Singing Ft Wav2vec2 Large Lv60 V2
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the ONSET-SINGING dataset based on wav2vec2-large-lv60, focusing on singing voice recognition tasks.
Speech Recognition Transformers
A
gary109
16
1
Wav2vec2 Large Xls R 300m Guarani Small Wb
Apache-2.0
This model is an automatic speech recognition (ASR) model based on the wav2vec2-large-xls-r-300m architecture, fine-tuned on the Guarani speech dataset.
Speech Recognition Transformers
W
jhonparra18
16
0
Wav2vec2 Large Xls R 300m Turkish Colab
Apache-2.0
A speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
W
Siddique
29
0
Wav2vec2 Base Keyword Spotting
Apache-2.0
A fine-tuned speech keyword recognition model based on wav2vec2-base on the superb dataset, achieving 98.43% accuracy
Audio Classification Transformers
W
anton-l
14
0
Wav2vec2 Xls R 300m Adult Child Cls
Apache-2.0
A fine-tuned adult-child voice classification model based on facebook/wav2vec2-xls-r-300m, achieving 94.04% accuracy
Audio Classification Transformers
W
anantoj
48
0
Wav2vec2 Large Xls R 300m My Hindi Home Latest Colab
Apache-2.0
This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on a general speech dataset, primarily used for speech recognition tasks.
Speech Recognition Transformers
W
nimrah
16
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase